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Abstract —Polar codes are the first provably capacity-achieving 
forward error correction codes. To improve decoder throughput, 
the symbol-decision SC algorithm makes hard-decision for multi¬ 
ple bits at a time. In this paper, we prove that for polar codes, the 
symbol-decision SC algorithm is better than the bit-decision SC 
algorithm in terms of the frame error rate (FER) performance 
because the symbol-decision SC algorithm performs a local 
maximum likelihood decoding within a symbol. Moreover, the 
bigger the symbol size, the better the FER performance. Finally, 
simulation results over both the additive white Gaussian noise 
channel and the binary erasure channel confirm our theoretical 
analysis. 

Index Terms —Error control codes, polar codes, successive can¬ 
cellation, performance analysis, bit-decision decoding, symbol- 
decision decoding 


I. Introduction 

Polar codes, a groundbreaking discovery by Arikan |1J, 
provably achieve the capacity of any discrete [i 11 and con¬ 
tinuous |2] memoryless channels. Since their debut, a lot of 
efforts have been made to improve the error performance of 
short polar codes. Although a sphere decoding algorithm |3|, 
stack sphere decoding algorithm |4j or a Viterbi algorithm 
0 can provide maximum likelihood (ML) decoding of po¬ 
lar codes, they are considered infeasible due to their high 
complexity. Compared with these ML decoding algorithms, 
the successive cancellation (SC) decoding algorithm [jTJ has 
a lower complexity at the cost of sub-optimal performance. 
Another drawback of the SC algorithm is its long decoding 
latency and low decoding throughput because the SC algorithm 
makes hard bit decisions only one bit at a time. To reduce 
the decoding latency and improve the throughput from a 
perspective of parallel processing, a parallel SC algorithm was 
proposed in |6]| and 0- The symbol-decision SC algorithm in 
|8), which has the same decoding schedule as the parallel SC 
algorithm, makes hard symbol decisions one at a time. In terms 
of error performance, numerical simulation results in |6| and 
[ 71 were used to show that the parallel SC algorithm has no 
performance loss compared with the SC algorithm. There is 
no theoretical analysis in the literature that shows whether the 
parallel and symbol-decision SC algorithms are superior or 
inferior to the SC algorithm (TJ, referred to as the bit-decision 
SC algorithm henceforth. 

In this paper, besides numerical simulations, we prove that 
in terms of frame error rate (FER) performance, the symbol- 
decision SC algorithm is better than the bit-decision SC 
algorithm. Moreover, the bigger the symbol size, the better the 
FER performance. Finally, simulation results over the additive 


white Gaussian noise (AWGN) channel and the binary erasure 
channel (BEC) confirm our theoretical analysis. 

The rest of this paper is organized as follows. In Section [II] 
polar codes are reviewed as well as the bit- and symbol- 
decision SC algorithms. In Section [III] we prove that the 
symbol-decision SC algorithm has a better FER performance 
than the bit-decision SC algorithm. In this section, we also 
show how to make use of future frozen bits within a symbol 
by the symbol-decision SC algorithm. Numerical simulation 
results are presented to confirm our theoretical conclusion as 


well. Finally, some conclusions are provided in Section IV 


II. Bit-Decision and Symbol-Decision SC 
Algorithms for Polar Codes 

A. Polar codes 

For simplicity, we denote (u a ,u a+ i, - ■ ■ ,Ub-i,Ub) as u b a \ 
if a > b, u b a is regarded as void. For any index set A C 
1 = {1,2, ••• ,N}, iDi = (m : 0 < * < N, i € A) is 
the sub-sequence of u = u restricted to A. We denote the 
complement of A in I as A c . 

Suppose N = 2 n , for an (TV, K) polar code, the data bit 
sequence u = is divided into two parts: a AT- element part 
u_4 which carries information bits, and u_4= whose elements 
are predefined frozen bits. For convenience, frozen bits are set 
to zero. 

To generate the corresponding encoded bit sequence x = 

X 1 , 

x = u B N F® n , (1) 

where Bn is the N x N bit-reversal permutation matrix, F = 
[} °], and F® n is the n-th Kronecker power of F [jTJ. 


B. Bit-Decision SC Algorithm for Polar Codes 

When x is transmitted, suppose the received bits are y = 
yf. The bit-decision SC algorithm |1| for an ( N,K) polar 
code estimates the data bit sequence u successively: for j = 
1, 2, • • • , N, itj = 0 if Uj is a frozen bits, otherwise it is 
estimated by 


Uj = argmaxPr(y,u{ x | Uj). (2) 

«j£{0,l} 

Here, the bit-decision SC algorithm makes hard bit decisions 
one bit at a time. 
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C. Symbol-Decision SC Algorithm for Polar Codes 

The M- biQ parallel and symbol-decision SC algorithms 
|6]-|8| make hard-decision for M bits instead of only one 
bit at a time. For 0 < j < jj, the y-th symbol is estimated 
successively by 


fjM+M 
a jM+1 


arg max 


3 


\AM?\ 


Pr(y, u 


jM 

l 


, jM+Ms 
\ u jM+l /! 


( 3 ) 


where TM, = {jM + l,jM + 2, • • • ,jM + M} C T, 
AMj = XMj n A, AM c j = IMj n A c , and \AMf 
represents the cardinality of AMj. If M = N, the M -bit 
symbol-decision SC algorithm is exactly an ML sequence 
decoding algorithm. 


III. Performance Analysis of the Symbol-Decision 
SC Algorithm 

A. FER Analysis of the Symbol-Decision SC Decoding Algo¬ 
rithm 

To have a fair comparison, we assume the symbol-decision 
decoding has the same bit sequence u as its bit-decision coun¬ 
terpart. Without loss of generality, we consider two decoding 
scenarios shown in Fig. |T] In both scenarios, an A'-hit vector 
is divided into f segments. Each segment has M bits. The 
bit-decision SC and M -bit ML decoding algorithms are used 
to decode each segment of scenarios (a) and (b), respectively. 
A box means that a decision is made. From a segment to the 
following segment, both scenarios use the same schedule - 
the successive schedule. Then scenarios (a) and (b) exactly 
correspond to the bit-decision SC and M -bit symbol-decision 
SC algorithms, respectively. Note that when a different bit 
sequence is used for both, all conclusions still apply. 


(a) 


. 

. 

. 

. 

Bits: 

1 2 

M \M+1 

2M\ 

\n-m+ 1 

N 

(b) 

• • - 

— *1H* 

- 

■”tH • - 

. • 

Bits: 

1 2 

M \m+ 1 

2 m\ 

\n-m+1 

N 


Fig. 1. Decoding procedures of (a) a bit-decision SC algorithm and (b) an 
M-bit symbol-decision SC algorithm. 

In terms of the FER performance, we have 

Proposition 1. If all data sequences are independent and 
equally likely, for an (A, K) polar code over any given chan¬ 
nel, the FER of the bit-decision SC algorithm Pr B (uj f uf) 
and the FER of the M-bit symbol-decision SC algorithm 

Pr M (ztj v f uf ) satisfy: 

Pr M (wf f Ui ) < Pr B (uf f uf). (4) 

Proof: Let us calculate the FERs of the two scenar¬ 
ios shown in Fig. 0 Let p 0 = Prsctw^ f uf) and 
p' 0 = Ptml^i* f uf) denote the segment error rate 
of uf f uf by using the SC and ML decoding algo¬ 
rithms, respectively. Similarly, for i = 1, 2, ■ • • , ^ — 1, let 

Although the symbol size M can be any integer no more than N, we 
assume M\N for simplicity. 


Pi = Prsc( 


'iM+1. 


7 ^ u : 


iM +1 


= u\ M ) and p\ = 


Pr 7^ nZXT\n\ M = < M ){1 < i < £) repre¬ 


sent the probabilities of that the «-th segment is erroneously 
decoded by the SC and ML decoding algorithms, respectively, 
provided that all previous segments are correctly decoded. 

Then we have the segment error probability Pr (uf f 
<) = Pr« 7 ^ <| 2 /f)Pr«). Since Pr«) is 
independent of the decoding rule, to minimize Pi :(uf f uf), 
we need to minimize Pr (uf f uf\yf), i.e., to maximize 
Pr« = <|t+). 

Because 

Pi(yf\uf)Pi(uf) 


Pr«| 2 /f) = 


Pr«) 


and nf is a uniformly distributed random variable, the ML 
decoder maximizes Pr (uf = uf\yf). Therefore, we have 


Po >Po- 


( 5 ) 


For any 1 < i < 


Pr( 


iM+M 

iM+1 




■iM+M | 
iJtf+1 


iM+M \ N s-AM 


a iM +1 \yi > 

Pr (v lM+M 
rl \ u iM +1 
iM+M, N 
u iM+1 Wl > u 

Because 


N_ 

M> 

i\ M = U\ M ) = ZyvPfi 

AM\v> r + ,N\ 


the segment error probability 

7^ 


'iM+1 


u\‘" = u\ M )Pr(yf). Hence, to minimize 

/ iM+M 1 -AM _ „AM\ p T (f.%M+M 

r u iM+l I U 1 — U 1 >’ iri \ u iM +1 — 

\ M = u\ M ) need to be maximized. 


Pr(ui%X?\y?,ui M =u[ M ) = 


Pr {y 


N MM 
1 )“1 


iM\.,iM+M 


U. 


iM+1 


)Pr 


iM+M \ 
l iM+1 > 


Pr (yf ,u\ M = u\ M ) 

and ufiff is a uniformly distributed random variable, the 


*iM +1 


AM 


'iM+M 

l iM+l 


ML decoder maximizes Pr( 

). Therefore, we also have 


N 

Pi > Pi for 1 < i < —. 

For the bit-decision SC algorithm. 


iM+M \ N +AM 


iM+1 


I u 


1 ) “1 


( 6 ) 


Pr b{ui = (1 - ft). 

»=0 

For the M-bit symbol-decision SC algorithm, 

w- 1 

p r M (uf fuf) = l- (1 -p[). 

i =0 

According to (|5j and ([6]>, we have 

PrM(wf f uf) < Pr B (wf f uf). 


(a) •..»H».• j V-.4 H*.«H».• 

Bits: 1 M M+l 2M 1 2 M+1 3 M 3M+1 4 M ! I/V -2M+1 N-M N-M+l N 

(b) •.* .• •.g V.>• • • • 

Bits: 1 M M+l 2M 1 2M+1 3 M 3M+1 AM 1 i/V-2/W+l N-M N-M+l N 

Fig. 2. Decoding procedures of (a) an M-bit symbol-decision SC algorithm 
and (b) a 2M-bit symbol-decision SC algorithm. 

Furthermore, we have 
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Fig. 3. Tree graph of a (32, 16) polar code. 


Proposition 2. If all data sequences are independent and 
equally likely, for an (N, K) polar code over any given 
channel, the FER of an M-bit symbol-decision SC algorithm 
Pi'm(u] V 7 ^ wf") and the FER of a 2M-bit symbol-decision 
SC algorithm Pi' 2 m('U] V 7 ^ u^) satisfy: 

Pr 2 M(«f 7 ^ Ui) < Pr M (wf ^ wf )• (7) 

By considering the two scenarios in Fig. [2] Proposition [2] 
can be proved in a similar way as for Proposition [T| 

Therefore, the symbol-decision SC algorithm is no worse 
than the bit-decision SC algorithm in terms of the FER 
performance and bridges the FER performance gap between 
the bit-decision SC algorithm and the ML decoding algorithm. 

B. Message Passing Interpretation 

The SC algorithm can be considered as message passing 
over a tree graph (9). From the perspective of message passing 
over a tree graph, we provide an explanation of the advantage 
of the symbol-decision decoding. To this end, we introduce 
a string vector Si (for 0 < i < jj) to 

represent a data pattern of the v-th M-bit symbol of a polar 
code with length N. If UiM+j is an information bit, S lt j 
is denoted as ’T>’. Otherwise, Sij as ’J 7 ’. Consider a toy 
example of a 4-bit symbol Assuming ua + i and ri 4 i+3 

are information bits, and uu +2 and 11^+4 are frozen bits. Then 
the data pattern of u^_\ is 'VTVT'. Obviously, for an M-bit 
symbol, there are 2 A1 possible data patterns. We divide them 
into two types. The first type is called a DP-I pattern, which 
has no ”D’ or has no ’J 7 ’ after the first ”D\ There are only 
(M + 1) DP-I patterns. The remaining (2 M — M — 1) patterns 
are called DP-II patterns. Henceforth, a symbol which has 
a DP-I (DP-II, respectively) pattern is called a DP-I (DP-II, 
respectively) symbol. 

As pointed out in (T), the bit-decision decoding does not 
take advantage of future frozen bits. That is, when decoding 
information bit m (i € A), the fact that Uj (j E A c and 3 > 
i) is a frozen bit is not accounted for by the bit-decision SC 
algorithm. For the symbol-decision SC algorithm, the future 
frozen bits in future symbols and within a DP-I symbol cannot 
be taken advantage of either. However, the decision rule of the 
symbol-decision SC algorithm can be regarded as a local ML 
decoder. As a result, some information bits can take advantage 
of their future frozen bit(s) within any DP-II symbol. 

We consider a tree graph representation, shown in Fig. [3] 
of a (32, 16) polar code constructed with the method in flO) . 
Nodes on the bottom (from left to right, u\ to u 32 ) are called 
leaf nodes. Each leaf node corresponds to a data bit. There are 
three kinds of nodes in the tree graph. A rate-0 node whose 


descendant leaf nodes are all frozen bits is represented by a 
black node. A rate-1 node whose descendant leaf nodes are all 
information bits is represented by a white node. The rest are 
rate-R nodes in gray. Some descendant leaf nodes of a rate-R 
node are frozen bits, and the others are information bits. We 
consider how to use the knowledge of a frozen bit from the 
perspective of message passing. The knowledge of a frozen 
bit can be passed through only the rate -0 nodes according to 
the encoding of polar codes. 

Given a tree graph and M, data patterns are determined. 
For the tree graph in Fig. [3] all data patterns of M = 2,4, 
and 8 are listed in Table [I] When M = 2 and 4, there are no 
DP-II symbols. 


TABLE I 

Data patterns of the (32, 16) polar code for different Ms. 


M 

DP-I 

DP-H 

2 

TT, TV. VV 

none 

4 

TTTT, TTTV, TVVV. VVVV 

none 

8 

TTTTTTTT. VVVVVVVV 

TTTVTVVV 


Let us take the decoding of U 12 as an example. Although 
U 13 is a frozen bit, this knowledge needs to pass through some 
intermediate nodes a—H>—>c— 7 d—>-e before being received by 
«i 2 if it is to be taken advantage of in the decoding of uyj. 
However, because there is at least one rate-R node in the 
message passing route from U 13 to mi 2 , the decoding of ui 2 
cannot take advantage of the frozen bit U 13 . However, for the 
8 -bit symbol-decision SC algorithm, the DP-II symbol w, 1 /’ 
is decoded as a symbol simultaneously. The frozen bits (itg 1 
and U 13 ) help to decode the information bits (u , 12 and u \|). 
Therefore, unlike the bit-decision SC decoding algorithm, the 
8 -bit symbol-decision SC decoding does take advantage of ii \ 3 
to decode U 12 . If the 2 -bit or 4-bit symbol-decision algorithm 
are used, no future frozen bits can be taken advantage of in 
decoding any information bit because all 2-bit or 4-bit symbols 
are DP-I symbols. In terms of the FER over the BEC, SDSC- 
32 (ML) < SDSC-16 « SDSC -8 < SDSC-4 « SDSC-2 « SC 
(shown in Fig. |4j, where SDSC-i represents the 7 -hit symbol- 
decision SC algorithm and SDSC-32 is also an ML algorithm. 

C. Simulation Results 

Figs. [5] and [ 6 ] show bit error rates (BERs) and FERs of 
symbol-decision SC algorithms with different symbol sizes for 
a (1024, 512) polar code constructed by the method in [ [TO] 
over the AWGN channel and the BEC. Regarding data patterns 
of the (1024, 512) polar code, all 2-bit and 4-bit data symbols 
are DP-I symbols. However, for the SDSC -8 algorithm, 8 
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Fig. 6. Error rates of decoding algorithms for the (1024, 512) polar code 
Fig. 4. Error rates of decoding algorithms for the (32, 16) polar code over over the BEC. 
the BEC. 


of 128 data symbols are DP-II symbols. For the SDSC-16 
algorithm, 12 of 64 data symbols are DP-II symbols. In terms 
of the FER, SDSC-16 < SDSC-8 < SDSC-4 « SDSC-2 « 
SC for the (1024, 512) polar code. The simulation results are 
consistent with Propositions [I] and [2] 

The performance gains are small in our simulation results, 
but these simulation results still reveal how the symbol size 
affects the FER performance of the symbol-decision SC algo¬ 
rithm. If a larger performance gain is expected, the symbol size 
should be increased further. However, for larger symbol sizes, 
we do not provide the simulation results because simulations 
are very time-consuming. 



Fig. 5. Error rates of decoding algorithms for the (1024, 512) polar code 
over the AWGN channel. 

In terms of the BER performance, although we cannot offer 
a rigorous proof, we conjecture that the symbol-decision SC 
algorithm is better than the bit-decision SC algorithm. The 
simulation results in Figs. [5] and [6] are consistent with this 


conjecture. 

IV. Conclusion 

This letter proves that the symbol-decision SC algorithm 
performs better than the bit-decision SC algorithm for polar 
codes in terms of the FER performance. Increasing the sym¬ 
bol size increases the FER performance gain. Therefore, the 
symbol-decision SC algorithm bridges the FER performance 
gap between the bit-decision SC algorithm and the ML de¬ 
coding algorithm for polar codes. 
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